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Tetrapyrroles, including haem and chlorophyll, play vital roles for 
various biological processes, such as respiration and photosyn- 
thesis, and their biosynthesis is critical for virtually all organ- 
isms. In photosynthetic organisms, magnesium chelatase 
(MgCh) catalyses insertion of magnesium into the centre of 
protoporphyrin IX, the branch-point precursor for both haem 
and chlorophyll, leading tetrapyrrole biosynthesis into the 
magnesium branch’. This reaction needs a cooperated action 
of the three subunits of MgCh: the catalytic subunit ChIH and 
two AAA* subunits, Chil and ChID (refs 3-5). To date, the 
mechanism of MgCh awaits further elucidation due to a lack of 
high-resolution structures, especially for the ~150 kDa catalytic 
subunit. Here we report the crystal structure of ChIH from the 
photosynthetic cyanobacterium Synechocystis PCC 6803, 
solved at 2.5A resolution. The active site is buried deeply 
inside the protein interior, and the surrounding residues are 
conserved throughout evolution. This structure helps to explain 
the loss of function reported for the cch and gun5 mutations of 
the ChIH subunit, and to provide the molecular basis of substrate 
channelling during the magnesium-chelating process. The 
structure advances our understanding of the holoenzyme of 
MgCh, a metal chelating enzyme other than ferrochelatase. 

Chlorophyll, the most abundant pigment in plants, algae and 
cyanobacteria, is synthesized through a multistep pathway in 
which protoporphyrin IX (Proto) is the branch-point precursor 
for both haem and chlorophyll. In contrast to the single-subunit 
ATP-independent ferrochelatase that catalyses insertion of a 
ferrous iron into the Proto ring®, MgCh comprises three subunits, 
ChlH, ChlI and ChID, and requires ATP for magnesium chelation. 
To complete a catalytic cycle, the three subunits cooperate in a 
dynamic manner with the AAA* subunits ChlI and ChID assem- 
bling into a two-tiered hexameric ring, and then interacting with 
the substrate-binding ChlH subunit to form a transient holoenzyme 
complex’, ATP hydrolysis by ChlI drives magnesium insertion and 
disassembly of the holoenzyme®”. To date, the structure of the hex- 
americ ChlI-ChID motor unit has been observed by electron 
microscopy!*", and the structure of the monomeric ChlH (or 
BchH, the bacterial homologue of ChlH) has been observed by elec- 
tron microscopy at 25-30 A resolution!*. 

Here we describe the 2.5 A structure of a full-length ChlH from 
cyanobacterium Synechocystis sp. PCC 6803 (Table 1). The overall 
architecture with dimensions of 133 A x 80 Å x 75 A is a cage-like 
assembly preceded by a small amino (N)-terminal region (Fig. 1), 
which is consistent with electron microscopy studies and small- 
angle X-ray scattering analysis'*!>. The crystal structure shows 
that ChIH is composed of six domains (I-VI), with domains 


II-VI constituting the cage-like assembly; domains I and II form 
the N-terminal ‘head’ and ‘neck’ regions. Domains I and II have a 
B-sheet core similar to the carboxy (C)-terminal domain of transke- 
tolase’®, with five B-strands arranged in the order of 1-3-2-4-5 
(Supplementary Fig. 1). Domain III exhibits a previously unre- 
ported fold that consists of a nine-stranded (2-3-4-1-5-6-7-9-8) 
B-sheet and seven main a-helices (al-a7). A large a-helical bundle 
is inserted between B7 and a7 of domain III, and this insertion con- 
stitutes domain IV. Domain V also adopts a distinct fold, with a seven- 
stranded (4-3-2-1-5-6-7) B-sheet, 11 a-helices (al-a11) and a hairpin. 
The C-terminal domain VI is a compact a-helical bundle. 

In the crystal, two ChlH subunits are related by a non-crystallo- 
graphic twofold axis (Fig. 2a). The dimerization interface occurs 
between domains I and V and has an area of 1,808 A’. Polar inter- 
actions including six hydrogen bonds and a salt bridge stabilize the 
dimer (Fig. 2b). Apart from the polar interactions, no other 


Table 1 | Data collection, phasing and refinement statistics 


Native ChIH SeMet ChIH 

Data collection 
Space group H3 H3 
Cell dimensions 

a, b, c (A) 319.7, 319.7, 105.2 321.1, 321.1, 105.2 

a By) 90, 90, 120 90, 90, 120 
Wavelength (A) 0.9793 0.9793 
Resolution (A) 50.0-2.5 (2.59-2.50) 50.0-3.0 (3.11-3.0) 
Ramo OF Rage 0.085 (0.820) 0.202 (21) 
I/ol 16.7 (2.3) 28.5 (3.6) 
Completeness (%) 99.9 (100.0) 100.0 (100.0) 
Redundancy 4.1 (4.1) 223M1) 
Refinement 

Resolution (A) 46.0-2.50 

No. of reflections 134,365 

Rewari Paes 0.179/0.209 
No. of atoms 

Protein 19,288 

Ligand/ion O 

Water 880 
B-factors 

Protein 48.6 

Ligand/ion None 

Water 47.9 
r.m.s deviations 

Bond lengths (A) 0.003 

Bond angles (°) 0.873 


Highest resolution shell is shown in parenthesis. 
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Figure 1 | Crystal structure of the ChIH subunit. a, Three surface models related by 90° rotation along the vertical axis. The head (1) domain is blue, the 
neck (IlI) domain in green, domain III in orange, the insertion (IV) domain in brown, domain V is magenta, and domain VI is purple. b, Schematic 
representation of Synechocystis ChlH subunit. Regions not visible in the crystal structure include residues 140-156, 221-229, 327-330, 380-391, 415-424 and 
630-640. The boundary between domains | and II that cannot be defined (residues 221-229) is grey. c, Domains I-VI as ribbon diagrams. 


Figure 2 | Dimer in crystal. a, Two orthogonal views of the ChIH subunit dimer shown as a ribbon diagram. One monomer is grey and the other is coloured 
as in Fig. 1. b, The dimerization interface. Residues involved in polar interactions (dashed lines) are shown as sticks. Hydrogen bonds are between the 
following six pairs: Phe2-Glu1025, Lys6-Asp1064, Thr8-Glu1062, Glu15-Asn1134, Arg43-Asp1136, Arg47-Ser1135; the salt bridge is between Glu199 and Lys1018. 
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Figure 3 | The internal pocket. a, Ribbon diagram of the ChIH subunit with the internal pocket shown as a black surface. The boxes denote areas in d-f. 

b, Open-book representation of the interface between domains Ill (orange) and V (magenta). A tetrapyrrole ring is inside the pocket and shown as a black 
stick. c, Open-book representation of the domain IIl-V interface colour coded according to amino acid residue conservation. Identical residues are marine and 
similar residues are cyan. d-f, Close-ups of areas in a. Two ChIH monomers are overlaid to show the flexibility of the loop regions. One monomer is grey 
and the other is coloured as in Fig. 1. g, The gunS mutation site. Upper panel shows a close-up of the area around Ala942; lower panel shows an in silico 


model of the gunS mutation. 


interaction is found, even though the two neck domains seem adja- 
cent to each other. In solution, monomer is the predominant form 
of ChlH subunit, and only a small fraction of dimer exists 
(Supplementary Fig. 2). The observation is consistent with the 
reported solution state of Thermosynechococcus elongatus ChlH 
(TeChlH)'*. The exclusive interaction between domains I and V 
suggests that deletion of domain I may eliminate the formation of 
the loosely bound ChlH dimer, and is consistent with the obser- 
vation that removal of the N-terminal 159 residues of TeChlH 
resulted in a clearly monomeric state!®. 

An internal pocket is located at the interface between domains III 
and V, and its size is large enough to engulf a tetrapyrrole ligand 
(Fig. 3a,b). Residues surrounding the pocket are Asn467, Val468, 
Leu624, Asn657, Glu660 and Ile663 from domain III, and 
Trp985, Vall041, Phe1042, Asn1097, Tyr1102, Asn1107, Tyr1125, 
Lys1129, Asp1163, Ile1167, Vall172, His1174, Tyrl175 and 
Asp1177 from domain V (Supplementary Fig. 3). Although molecu- 
lar details of ChlH-Proto interaction cannot be accurately deter- 
mined without the Proto-bound structure, the unliganded 
structure described here offers the first picture of the Proto- 
binding pocket of ChIH. Multiple sequence alignment shows that 
the most conserved residues are concentrated in domains III and 
V (Supplementary Fig. 4). Notably, residues forming this pocket 
are mostly conserved (Fig. 3c), indicating a conserved Proto- 
binding mechanism for ChIH subunits of different species. We 
attempted to locate the magnesium binding site by co-crystallization 
and soaking with magnesium, but did not observe a potential site. 

The two ChlH subunits within the dimer are nearly structurally 
identical, having a root mean squared deviation of 0.68 A for 1,201 
Ca pairs. Two regions exhibiting major structural differences are 
residues Asp464-Thr470 and Thr577-Ala593 (Fig. 3d,e). Both 
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regions are loops of domain III and contribute to the interface 
with domain V. Following the second loop, there is a conserved 
proline at position 595 whose mutation to leucine in Arabidopsis 
(Pro642—Leu) leads to the conditional chlorina (cch) phenotype”. 
Aside from the two loops of domain III, three loops near the 
pocket are so flexible that their electron density is missing 
(Fig. 3f). Two of the missing loops are in domain II (Thr380- 
Leu391 and Ser415-Ala424), and the other is in domain II 
(Lys630-Pro640 or Met627-Asn642). A mutation at a conserved 
alanine at position 990 of Arabidopsis ChlH to valine 
(Ala990—Val) causes a genomes uncoupled phenotype and is 
called gun5 (ref. 18). The counterpart gun5 mutation in 
Synechocystis ChlH is at position 942, which is at the joint 
between domains III and V (Fig. 3g). The cch and gun5 mutant 
ChlH proteins retain Proto binding ability but are catalytically inac- 
tive!?. Our X-ray structure suggests that both mutations may intro- 
duce interior spatial hindrance and hence interfere with the 
chelation reaction occurring in the pocket. Electron microscopy 
maps of the hexameric motor unit of MgCh, the ChlI—ChlD 
complex, have revealed that large conformational change of the 
ChlI-ChID complex happens on ATP binding!. It is possible that 
through formation of a holoenzyme, the motor unit triggers confor- 
mation change of the ChlH subunit and thus fulfils its role 
for catalysis. 

The intraorganellar location of the tetrapyrrole biosynthetic 
enzymes has been proposed as a key control of the flux into either 
the haem branch or the chlorophyll branch!?. ChlH displays a 
dual location in chloroplast stroma and membrane”®??. 
Protoporphyrinogen IX oxidase (PPO) specifically PPO1 in 
Arabidopsis, the enzyme producing Proto in chloroplasts, also loca- 
lizes to both stroma and membrane’*”’. Interestingly, the crystal 
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Figure 4 | Model of the PPO-ChIH complex. The two subunits of the PPO dimer are shown in pink and orange, and the two ChIH subunits are shown in 


blue and green. Fog is used as a depth cue. 


structure of tobacco PPO shows that, like ChlH, PPO forms a loosely 
bound dimer™. The tobacco PPO dimer fits well with the ChlH 
dimer without any structural adjustment (Fig. 4). In this PPO 
-ChIH model, the substrate-binding domain and the flavin 
adenine dinucleotide-binding domain of PPO are in close contact 
with domains III and V of ChlH (Supplementary Fig. 5). The 
bound PPO inhibitor which mimics its substrate is ~49 A from 
the centre of the internal pocket, or ~35 A from the entrance to 
the pocket. The distal membrane-binding domain of PPO probably 
associates with the membrane, and Proto could be channelled 
through the substrate-binding domain of PPO to the internal 
pocket of ChlH. 

Association of ChlH to the chloroplast membrane is prompted 
by the Gun4—porphyrin complexes”!”>. Gun4, a positive regulator 
of MgCh, directly binds to Proto and MgProto, the substrate and 
the product of MgCh, respectively*~”*. MgProto is then recognized 
by the subsequent methyltransferase, ChIM*°. Based on these find- 
ings, a working model focused on the substrate uptake and product 
release of MgCh is proposed (Supplementary Fig. 6). The membrane 
association of PPO, ChlH and Gund ensures a flux of Proto to the 
chlorophyll branch, although the role of their stromal fraction is 
unclear. After the chelation reaction, Gun4 and ChIM help the 
product MgProto to release from ChlH, and thus regenerate the cat- 
alytic subunit for a new catalytic cycle. 


Methods 

Expression, purification and crystallization. The chlH gene of Synechocystis sp. 
PCC 6803 was inserted into the pET-28a(+) plasmid (Novagen) between the NdeI 
and Xhol sites to generate the pET-28a—His,-ChlH plasmid. The recombinant His- 
tagged ChlH subunit was expressed in Escherichia coli BL21(DE3) cells after adding 
0.2 mM isopropyl B-p-1-thiogalactoside when the cell density reached A600 nm Of 0.6. 
After growth at 18 °C for 16-18 h, the cells were harvested by centrifuge, 
resuspended in a lysis buffer, containing 20 mM Tris-HCl, pH 7.5, 500 mM NaCl 
and 20 mM imidazole, and disrupted by sonication. The recombinant protein was 
purified from the cleared lysate using a nickel-nitrilotriacetic acid column (Qiagen) 
and then size-exclusion chromatography with a HiLoad 16/60 Superdex 200 column 
(GE Healthcare). The purified protein was concentrated by ultrafiltration in a buffer 
containing 20 mM Tris-HCl, pH 7.5, 150 mM NaCl and 2 mM dithiothreitol, 
shock-frozen in liquid nitrogen and stored at —80 °C. For crystallization, the purified 
protein was diluted to a concentration of 10 mg ml’. Both the native and 
selenomethionine-substituted crystals of the ChlH subunit were obtained using the 
vapour diffusion method at 16 °C in a mixture containing 1-2 ul of protein sample 
and an equal volume of reservoir solution (0.85 M sodium citrate, pH 7.0 and 2 mM 
dithiothreitol) taken from 1 ml of reservoir solution. The crystals qualified for X-ray 
data collection were optimized by the seeding method. Before data collection, 
crystals were harvested with nylon loops, and cryo-protected in the crystallization 
solution supplemented with 20% (v/v) glycerol. Crystals were then shock-frozen in 
liquid nitrogen for X-ray diffraction experiment. 


Data collection and structure determination. All X-ray diffraction data were 
collected at beamline BL17U of the Shanghai Synchrotron Radiation Facility at a 
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wavelength of 0.9793 A at 100 K. The data were indexed, integrated and scaled with 
HKL2000 (HKL Research Inc.). Selenium positions were determined using the 
SHELXD program*!. A preliminary model was built using AutoSol and AutoBuild 
from PHENIX*. Additional missing residues were manually added according to the 
2F,-F. and F,-F. electron density maps in Coot*’. The native ChlH subunit 
structure was determined by molecular replacement with the selenomethionine- 
substituted structure as the searching model. Automatic model building was 
performed using the ARP/wARP program™, and manual readjustments were done 
in Coot**. Further refinement was performed with PHENIX*. The overall quality of 
the structure was assessed by PROCHECK*, and in Ramachandran 97.47, 3.21 and 
0.32% of the residues were most favoured, additional allowed and disallowed regions, 
respectively. Data collection and structure refinement statistics are summarized in 
Table 1. The interaction surface area was calculated by PISA**. The protein structure 
figures were prepared using PyMOL”. 
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